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A motion estimation and compensation technique is provided for interlaced digital 
video such as video object planes (VOPs). Predictor motion vectors for use In 
differentially encoding a current field coded macroblock (700) are obtained using 
the median of motion vectors (MV1, MV2, MVS) of sun^ounding blocks or 
macroblocks (712, 722, 732). When a sun-ounding macroblock (820) is itself 
interlaced coded, an average motion vector for that macrobloc 
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k is used, with fractional pixel values being mapped to the half-pixel. When the 
current block (912) is not interlaced coded but a sun-ounding block (820) is, the 
field motion vectors may be used individually or averaged. In a repetitive padding 

technique for an interlaced coded VOP (1,010), the even (0, 2 14) and odd 

(1, 3 15) lines of the VOP and sun-ounding block (1,000) are grouped. Within 

each field (1,210, 1,220), exterior pixels (1,6) are padded by setting them to the 
value of the nearest boundary pixel (0,6), or to an average of two boundary pixels 
(0,6; 1,7). The lines are then reordered to provide a single padded reference 
VOP image. 
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(54) Motion estimation and compensation of video object planes for interlaced digital video 



(57) A motion estimation and compensation tech- 
nique is provided for interlaced digital video such as 
video object planes (VOPs). Predictor motion vectors 
for use in differentially encoding a cun-ent field coded 
macroblock (700) are obtained using the median of 
motion vectors (MV1, MV2, MVS) of surrounding blocks 
or macroblocks (712, 722, 732). When a surrounding 
macroblock (820) is itself interlaced coded* an average 
motion vector for that macroblock is used, with fractional 
pixel values being mapped to the half-pixel. When the 
cunrent blodc (912) is not interlaced coded but a sur- 



rounding block (820) is, the f iekl motion vectors may be 
used individually or averaged. In a repetitive padding 
technique for an interlaced coded VOP (1.010), the 

even (0, 2 14) and odd (1 . 3 15) lines of the VOP 

and surrounding block (1 .000) are grouped. Within each 
field (1,210. 1.220), exterior pixels (1,6) are padded t>y 
setting them to tiie value of the nearest boundary pixel 
(0.6), or to an average of two boundary pixels (0.6; 1 ,7). 
The lines are then reordered to provide a single padded 
reference VOP image. 
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Description 

BACKGROUND OF THE iNVENTION 

5 The present invention relates to a method and apparatus for coding of digital video images such as video object 
planes (VOPs), and, in particular, to motion estimation and compensation techniques for interlaced digital vtdea A pad- 
ding technique for extending the area of an interlaced coded reference VOP is also disclosed. 

The invention is particularly suitable for use with various multimedia applications, and Is compatible with the MPEQ- 
4 Verification Model (VM) standard described in document ISO/IEC/JTC1/SC29/WG1 1 N1642. entitled "MPEG-4 Video 

10 Verification Model Version 7.0", April 1997, incorporated herein by reference. The MPEG-2 standard is a precursor to 
the MPEG-4 standard, and is described in document ISO/IEC 13818-2, entitled "Information Technology - Generic Cod- 
ing of Moving Pictures and Associated Audio, Recommendation H.262," March 25, 1994, incorporated herein by refer- 
ence. 

MPEG-4 is a new coding standard which provides a flexible framework and an open set of coding tools for commu- 
16 nication, access, and manipulation of digital audio-visual data. These tools support a wide range of features. The flexi- 
ble framework of MPEG-4 supports various combinations of coding tools and their corresponding functionalities for 
applications required by the computer, telecommunication, and entertainment (\.e„ TV and film) industries, such as 
database browsing, information retrieval, and interactive communications. 

MPEG-4 provides standardized core technologies allowing efficient storage, transmission and manipulation of 
20 video data in multimedia environments. MPEG-4 achieves efficient conpresslon, object scalability, spatial and temporal 
scalability, and en-or resilience. 

The MPEG-4 video VM coder/decoder (codec) is a block- and object-based hybrid coder with motton compensa- 
tion. Texture is encoded with an 8x8 Discrete Cosine Transformation (DCT) utilizing overlapped block-motion compen- 
sation. Object shapes are represented as alpha maps and encoded using a Content-based Aritiimetic Encoding (CAE) 
2S algorithm or a modified DCT coder, botii using temporal prediction. The coder can handle sprites as they are known 
from computer graphics. Other coding methods, such as wavelet and sprite coding, may also be used for special appli- 
cations. 

Motion compensated texture coding is a well known approach for video coding, and can be modeled as a three- 
stage process. The first stage is signal processing which includes motion estimation and compensation (ME/MC) and 

30 a two-dimensional (2-D) spatial transformation. The objective of ME/MC and the spatial transformation is to take advan- 
tage of tenporal and spatial congelations in a video sequence to optimize tiie rate^listortion performance of quantization 
and entropy coding under a complexity constraint The most common technkfue for ME/MC has been block matching, 
and the most common spatial transformation has been the DCT. 

However, special concerns arise for ME/MC of VOPs, particularly vjhen tiie VOP is itself interlaced coded, and/or 

35 uses reference images which are interlaced coded. Moreover, for arbitrarily shaped VOPs which are interlaced coded, 
special attention must be paid to tine area of tiie reference image used for motion prediction. 

Accordingly, it would be desirable to have an efficient technique for ME/MC coding of a VOP which is itself inter- 
laced coded, and/or uses reference images which are interlaced coded. The technique should provide differential 
encoding of the motion vectors of a block or macroblock of tiie VOP using motion vectors of neighboring blocks or mac- 

40 roblocks. A corresponding decoder should be provided. It would further be desirable to have an efficient technique for 
padding the area of a reference image for coding of interlaced VOPs. The present invention provkJes a system having 
tiie above and other advantages. 

SUMIiMARY OF THE INVENTION 

In accordance with the present invention, a method and apparatus are presented for motion estimation and motion 
compensation coding of a video object plane (VOP) or similar video Image which is itself Interlaced coded, and/or uses 
reference images which are interlaced coded. 

A first method provides horizontal and vertical motion vector components for use in differentially encoding respec- 
so tive horizontal and vertical motion vector components of first and second fields of a current field coded macroblock of a 
digital video image. Candidate first, second and tiiird blocks near tiie current macroblock have associated horizontal 
and vertical motion vector components which can be used for predicting the motion vectors of the cunrent macrobtock. 
The first block immediately precedes tiie current macroblock of a current row, the second block is immediately above 
the current macroblock in a preceding row, and the third block immediately follows the second block in the preceding 
ss row. Thus, the candidate blocks are in a spatial neighborhood of tiie cunrent macroblock. 

A horizontal motion vector component is selected for use in differentially encoding the horizontal motion vectCM* 
components of tiie first and second fields of tiie cunent field coded macroblock according to the median of the horizon- 
tal motion vector components of the first, second and third candidate blocks. Alternatively, an average or some other 
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weighted function may be used. A vertical nfiotion vector component is determined similarly. 

When one of the candidate blocks is a subset of a macroblocK the block which is closest to the upper left-hand por- 
tion of the cuaent maaoblock is used as the candidate block of that particular macroblock. For example, the candidate 
block may be an 8x8 block in a 1 6x1 6 macroblock. 
6 A second method provides horizontal and vertical motion vector components for use in differentially encoding hor- 
izontal and vertical motion vector components, respectively, of a current progressive-predicted or advanced-predicted 
block of a digital video image. A progressive predicted block may be a 1 6x16 macroblock. An advanced predictton bkx:k 
uses a combination of 8x8 motion compensation and overlapped block motion compensation. In either case, the current 
block is not interlaced coded. 

10 Candidate first, second and third blocks have associated horizontal and vertical motion vector component. If at 
least one of the candidate blocks is a field coded candidate macroblock having first and second f iekte, then the first and 
second fields each have corresponding horizontal and vertical motion vector components. A horizontal motion vector 
component is selected for use in differentially encoding the horizontal motion vector component of the cun^ent block 
according to a value derived from the horizontal mMon vector components of the first, second and third candkiate 

IS blocks. 

In particular, the selected horizontal motion vector component may be determined according to a median of the 
horizontal motion vector components of the candidate blocks, including the corresponding horizontal motion vector 
components of the first and second fields of the at least one field coded candidate macrobtock. 

Alternatively, the respective first and second field horizontal motion vector components of the at least one fieU 
20 coded candidate macroblock may be averaged to obtain at least one corresponding averaged horizontal motion vector 
component. The selected horizontal motion vector component Is then determined according to a median of the horizon- 
tal motion vector components of the candidate blocks other than^the at least one f iekJ coded candidate maaoUock. if 
any. and the at least one conresponding averaged horizontal motion vector component. 

For example, if all three candidate macroblocks are field G e., interlaced) predicted, the horizontal motion vector 
25 components of the first and second fields of each candidate macroblock are averaged to obtain three averaged horizon- 
tal motion vector components. The selected horizontal motion vector component for differentially encoding the horizon- 
tal motion vector component of the current block is then the median of the three averaged motion vector components. 
A vertical motion vector component is similariy selected. 

When first and second field motion vectors of tiie at least one field coded candidate macroblock are averaged, all 
30 fractional pixel offsets are mapped to a half-pixel displacement to provide a better predictkm. 

In a third method, the current macroblock is field predicted and at least one of the candidate blocks is a field coded 
macroblock. The selected horizontal motion vector component for use in differentially encoding the horizontal motion 
vector component of ttie first field of the current macroblock is determined according to a value derived from (i) the hor- 
izontal motion vector components of the candidate blocks other than the at least one field coded candidate macrobtock, 
35 if any, and (ii) tiie horizontal motion vector conrponents of the first field of the at least one field coded candidate mac- 
roblock. For exanple, the median may be used. Thus, only the first field of the field predicted candidate macroblock(8) 
is used. Alternatively, only the second field of the fiekl predicted candkiate macroblock(s) can be used to predict the 
second field of the cun^ent macroblock. 

In another alternative, the respective first and second field horizontal motion vector components of tiie at least one 
40 field coded candidate macroblock are averaged to obtain at least one conresponding averaged horizontal motion vector 
component. The selected horizontal motion vector component for use in differentially encoding the horizontal motion 
vector component(s) of at least one of the first and second fields of the cun'ent macroblock is determined according to 
a median of (i) tiie horizontal motion vectors of the candidate blocks otiier tiian tiie at least one field coded candidate 
macroblock. if any. and (iO tiie at least one conesponding averaged horizontal motion vector component A vertical 
45 motion vector component is similariy selected. 

When tiie first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged, all fractional pixel offsets are mapped to a half-pixel displacement 

A corresponding decoder method and apparatus are also presented. 

A method and apparatus are also presented for padding a digital video image which includes a field coded VOP 
so comprising interleaved top and bottom field pixel lines to provide a padded reference VOP. By padding tiie VOP, the 
image area is extended. The VOP is can-ied, at least in part, in a region which includes pixels which are extertor to 
boundary pixels of said VOP. The top and bottom field pixel lines are reordered from the interieaved order to provMe a 
top field block comprising the top field pixel lines, and a bottom field block comprising the bottom field pixel lines. The 
exterior pixels are padded separately within tiie respective top and bottom field blocks. 
65 After the exterior pixels have been padded, the top and bottom field pixel lines comprising tiie padded exterk3r pix- 
els are reordered back to tiie interleaved order to provide the padded reference image. 

During padding, when a particular one of the exterior pixels is located between two of the boundary pbcels of the 
VOP in ttie conresponding top or bottom f iekJ block, tiie exterior pixel is assigned a value according to an average of the 
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two boundary pixels. When a particular one of the exterior pixels is located between one of the boundary pixels of said 
VOP and an edge of the region in the conresponding field block, but not between two VOP boundary pixels in the cor- 
responding field blocK the exterior pixel is assigned a value according to one of the boundary pixels. The term 
"between" means bounded by interior pixels along a horizontal or vertical pixel grid line. For example, the region may 

5 be a 16x16 macroblock. 

When a particular exterior pixel is located between two edges of the region in the corresponding field blocK but not 
between a VOP boundary pixel and an edge of the region, and not between two of the VOP boundary pixels, the par- 
ticular exterior pixel is assigned a value according to at least one of: (a) a padded exterior pixel which Is closest to the 
particular exterior pixel moving horizontally in the region; and (b) a padded exterior pixel which is closest to the partic- 

10 ular exterior pixel moving vertically in the region. For example, when padded exterior pixels are available moving both 
horizontally and vertically from the particular exterior pixel In the region^ the average may be used. 

BRIEF DESCRIPTION OF THE DRAWINGS 

IS FIGURE 1 is an illustration of a video object plane (VOP) coding and decoding process In accordance with the 
present invention. 

FIGURE 2 is a block diagram of an encoder in accordance with the present invention. 
FIGURE 3 illustrates an interpolation scheme for a half-pixel search. 

FIGURE 4 illustrates a scheme for motion estimation with a restricted motion vector in accordance with tiie present 
20 inventtoa 

FIGURE 5 illustrates a scheme for motion estimation witii an unrestricted motion vector in accordance with tiie 
present Invention. 

FIGURE 6 illustrates reordering of pixel lines in an adaptive frameffiekJ prediction scheme in accordance with the 
present invention. 

25 FIGURE 7 illustrates a cunrent f iekJ nrKxJe macroblock with neighboring frame mode blocks having associated can- 
dkiate motion vector predictors in accordance with the present invention. 

FIGURE 8 illustrates a cunent field mode macroblock witii neighboring frame mode blocks and field mode macrob- 
lock having associated candidate motion vector predictors in accordance with the present invention. 

FIGURE 9 illustrates a current advanced prediction mode block with neighboring frame mode blocks and field mode 
30 macroblock having associated candidate motion vector predictors in accordance witii the present invention. 

FIGURE 10 illustrates macroblock-based VOP padding for nrxition prediction In accordance w'rth tiie present inven- 
tion. 

FIGURE 1 1 illustrates repetitive padding within a maaoblock for motion prediction in accordance witii the present 
Invention. 

35 FIGURE 12 illustrates repetitive padding witiiin a macroblock for motion prediction after grouping same-fiekJ pixel 
lines in accordance wittt the present invention. 

FIGURE 1 3 is a block diagram of a decoder in accordance wttii the present invention. 
FIGURE 14 illustrates a macroblock layer structure in accordance witii the present invention. 

40 DETAILED DESCRIPTION OF THE INVENTION 

A method and apparatus are presented for ME/MC coding of a VOP which is itself interlaced coded, and/or uses 
reference images which are interlaced coded, and for padding of the area of a reference image for coding of interlaced 
VOPs. 

45 FIGURE 1 is an illustration of a video object plane (VOP) coding and decoding process in accordance with the 
present invention. Frame 105 includes three pictorial elements, including a square foreground element 107, an oblong 
foreground element 108, and a landscape backdrop element 109. In fmme 115, the elements are designated VOPs 
using a segmentation mask such that VOP 117 represents tiie square foreground element 107, VOP 118 represents 
the oblong foreground element 108, and VOP 1 19 represents the landscape backdrop element 109. A VOP can have 

so an arbitrary shape, and a succession of VOPs is known as a video object. A full rectangular video frame may also be 
considered to be a VOR Thus, the term "VOP" will be used herein to indicate both arbitrary and non-arbitrary (e.g., rec- 
tangular) image area shapes. A segmentation mask is obtained using known techniques, and has a format similar to 
that of ITU-R 601 luminance data. Each pixel is identified as belonging to a certain region in the video frame. 

The frame 1 05 and VOP data from frame 1 1 5 are supplied to separate encoding functions. In particular, VOPs 1 1 7, 

55 1 18 and 1 19 undergo shape, motion and texture encoding at encoders 137, 138 and 139, respectively. Wrth shape cod- 
ing, binary and gray scale shape information is encoded. With motion coding, the shape information is coded using 
motion estimation witiiin a frame. With texture coding, a spatial transformation such as the OCT is performed to obtain 
transform coefficients which can be variable-length coded for compresston. 
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The coded VOP data is then combined at a multiplexer (MUX) 140 for transmission over a channel 145. Atterna- 
tively. the data may be stored on a recording medium. The received coded VOP data is separated by a demultiplexer 
(DEMUX) 150 so that the separate VOPs 117-119 are decoded and recovered. Frames 155, 165 and 175 show that 
VOPs 1 1 7, 1 1 8 and 1 1 9, respectively, have been decoded and recovered and can therefore be individually manipulated 

5 using a compositor 1 60 which Interfaces with a video library 1 70, for example. 

The compositor may be a device such as a personal computer which is located at a user's home to allow the user 
to edit the received data to provide a customized image. For example, the user*^ personal video library 1 70 may include 
a previously stored VOP 178 (e.g.. a circle) which is different than the received VOPs. The user may confipose a frame 
1 85 where the circular VOP 1 78 replaces the square VOP 1 1 7. The frame 1 85 thus includes the received VOPs 118 

10 and 119 and the locally stored VOP 178. 

In another example, the background VOP 109 may be replaced by a background of the user's choosing. For ernn- 
pie, when viewing a television news broadcast the announcer may be coded as a VOP which is separate from the back- 
ground, such as a news studio. The user may select a background from the library 170 or from another television 
program, such as a channel with stock price or weather infbrmatton. The user can therefore act as a video editor. 

IS The video library 170 nnay also store VOPs which are received via the channel 145. and may access VOPs and 
other image elements via a network such as the Internet. Generally, a video session comprises a single VOP, or a 
sequence of VOPs. 

The video object coding and decoding process of FIGURE 1 enables many entertainment business and educa- 
tional applications, including personal computer games, virtual environments, graphical user interfaces, vkJeooonfbr- 
20 encing, Internet applications and the like. In particular, the capability for ME/MC with interlaced coded (e.g., field mode) 
VOPs in accordance with the present invention provides even greater capabilities. 

FIGURE 2 is a block diagram of an encoder in accordance with the present invention. The encoder is suitable for 
use with both predictive-coded VOPs (P-VOPs) and bi-directionally coded VOPs (B-VOPs). 

P-VOPs may include a number of macroblocks which may be coded individually using an intra-frame mode or an 
2$ inter-frame mode. With intra-frame (INTRA) coding, the macroblock is coded without reference to another macrobtock. 
With inter-frame (INTER) coding, the macroblock is differentially coded with respect to a temporally subsequent frame 
in a mode known as fbnvard prediction. The temporally sufc>sequent frame is known as an anchor frame or reference 
frame. The anchor frame (e.g., VOP) must be a P-VOP. not a B-VOP. 

With forward prediction, the current macroblock is compared to a search area of macroblocks in the anchor frame 
30 to determine the best match. A corresponding motion vector describes the relative displacement of the current maaob- 
lock relative to the best match macroblock Additionally, an advanced prediction nrxxJe for P-VOPs may be used, where 
motion compensation is performed on 8x8 blocks rather than 16x16 macroblocks. Moreover, both intra-frame and inter- 
frame coded P-VOP macroblocks can be coded in a frame mode or a field mode. 

B-VOPs can use tiie forward prediction mode as described above in connection with P-VOPs as well as backward 
35 prediction, bi-directional prediction, and direct mode, which are all inter-frame techniques. B-VOPs do not currently use 
intra-frame coded macroblocks under the MPEG-4 Video Verification Model Version 7.0 referred to previously, aWmjgh 
this is subject to change. The anchor frame (e.g.. VOP) must be a P-VOP. not a B-VOR 

Witii backward prediction of B-VOPs. the current macroblock is compared to a search area of macroblocks in a 
temporally previous anchor frame to determine the best match. A corresponding motion vector describes the relative 
40 displacement of the current macroblock relative to the best match macroblock. With bi-directional prediction of B-VOPs, 
the cunrent macroblock is compared to a search area of macroblocks in botii a temporally previous anchor frame and a 
temporally subsequent anchor frame to determine the best match. Fonvard and t)ackward motion vectors describes the 
relative displacement of tiie current macroblock relative to the best match macroblocks. 

Wrth direct mode prediction of B-VOPs. a motion vector is derived for an 8x8 block when the collocated macroblock 
45 in the following P-VOP uses the 8x8 advanced prediction mode. The motion vector of the 8x8 block in the P-VOP is lin- 
early scaled to derive a motion vector for the block in the B-VOP without the need for searching to find a best match 
block. 

The encoder, shown generally at 200, includes a shape coder 210, a motion estimation function 220, a motion com- 
pensation function 230. and a texture coder 240, which each receive video pixel data input at terminal 205. The motion 

so estimation function 220, motion compensation function 230, texture coder 240. and shape coder 210 also receive VOP 
shape Information Input at terminal 207, such as tiie MPEG-4 parameter VOP_of_arbitrary_shape. When this parame- 
ter is zero, the VOP has a rectangular shape, and the shape coder 210 therefore is not used. 

A reconstructed anchor VOP function 250 provides a reconstructed anchor VOP for use by the motion estimation 
function 220 and motion compensation function 230. For P-VOPs, the anchor VOP occurs after the current VOP in pres- 

55 entation order, and may be separated from ttie current VOP by one or more intermediate images. The cunrent VOP is 
subtracted from a motion compereated anchor VOP at subtracter 260 to provide a residue which is encoded at the tex- 
ture coder 240. The texture coder 240 performs the DOT to provide texture information (e.g., transform coefficients) to 
a multiplexer (MUX) 280. The texture coder 240 also provides information which is summed with the output from the 
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motion compensator 230 at a summer 270 for input to the reconstructed anchor VOP function 250. 

Motion information (e.g.. motion vectors) is provided from the motion estimation function 220 to the MUX 280. while 
shape information which indicates the shape of the VOP is provided from the shape coding function 210 to the MUX 
280. The MUX 280 provides a corresponding multiplexed data stream to a buffer 290 for subsequent communication 
5 over a data channel. 

The pixel data which is input to the encoder may have a YUV 42:0 format The VOP is represented by means of a 
bounding rectangle. The top left coordinate of the bounding rectangle Is rounded to the nearest even number not 
greater than the top left coordinates of the tightest rectangle. Accordingly, the top left coordinate of the bounding rec- 
tangle in the chrominance component is one-half that of the luminance component. 

10 FIGURE 3 illustrates an interpolation scheme for a half-pixel search. Motion estimation and motfon compensation 
(ME/MC) generally involve matching a block of a current video frame (e.g.. a cunrent block) with a block in a search area 
of a reference frame (e.g., a predicted block or reference block). The reference frame(8) may be separated from tiie cur* 
rent frame by one or more intermediate images. The displacement of the reference block relative to tiie current block is 
the motion vector (MV), which has horizontal (x) and vertical componenta Positive values of tiie MV components 

IS indicate that the predicted block is to tiie right of, and below, the current block. 

A motion compensated difference block is formed by subtracting ttie pixel values of the predicted block from those 
of the cunrent block point by point. Texture coding is tiien performed on the difference block. The coded MV and the 
coded texture information of the difference block are transmitted to the decoder. The decoder can then reconstruct an 
approximated current block by adding the quantized difference block to the predicted block according to tiie MV. The 

20 block for METMC can be a 1 6x1 6 frame block (macroblock), an 8x8 frame block or a 1 6x8 f ieU block. 

In ME/MC, it is generally desirable to have small residue values for the difference block, use few bits tor the motion 
vectors, and have a low computational complexity. Due to its lower computational complexity relative to ottier difference 
measures, the Sum of Absolute Difference (SAD) is commonly used to select the motion vector which meets tiiese cri- 
teria, as follows. Let {c(/.y),/V o0.1,....A/-1} be the pixels of the cunrent block and 

2s {p(/n.n).m,nB-fl,-R+1 -1,0,1 H+/V-1 } be tiie pixels In the search range of the reference frama Then 



30 



(JlI.\c(i.j)-p(iJ)\}-C, ifCx,y) = (0.0) 



M jmO 



I,I,\<^iJ)'-p(i+xJ+y)\, otherwise 



35 where {x.ys-f?.-/?+i....,-i ,0,1 /?-1 }, R and C are positive constants. 

The (x. y) pair resulting in the minimum SAD value is the optimum full-pixel motion vector (MV) having horizontal 
and vertical components, e.g.. (MV^^MVy). Note that a MV of (0.0) is favored because a positive constant C is sut>- 
tracted from tiie SAD when (x, y) » (0,0). For example, for a 16x16 block. 0^128. Accordingly, tiie distribution of MVs 
can be concentrated near (0, 0) so tiiat entropy coding of the MVs is more efffoient 

40 Accuracy of (MVj^.MVy) is set at half-pixel. Interpolation must be used on tiie anchor frame so tiiat p(/+x.y+y) is 
defined for x or y being half of an integer. Interpolation is performed as shown in FIGURE 3. Integer pixel positions are 
represented by the symbol V. as shown at A. B. 0 and D. Half-pixel positions are indicated by circles, as shown at a. 
b, c and d. As seen, a « A , b = (A + B)//2 c = (A + C)/y2 ,andde(A-i-B + C + D)//4 , where "/T denotes rounded divi- 
sion, as discussed below in connection witii Tables 2 and 3. 

45 A motion compensated difference block is defined as d(/.y>c(/,y>p(/+M\^jf./+/Wl/^),/,y=0,1,..., A/-1 . The differ- 
ence block d{ij) is transformed, quantized, and entropy coded. At a decoder, the motion vector {MVj^,MVy) and quan- 
tized difference block {d (/.y)} are available to reconstiiict tiie cun'ent frame as fblfows: 



50 



55 



For a color format of Y:U:V = 4:2:0. the macroblock size is 16x16 pixels for tiie Y (luminance) component, and 8x8 
pixels for the U/V (chrominance) components. The search range R On half-pixel units) can be selected by the user and 
is signified by a parameter called f_code, where fl={(x,y):-2 '-"^®^x,y<2 '-^*'^} , as shown in Table 1 . below. For 
example, for f_code=1, the motion vector components can assume values from -16 to +15.5. in half-pixel increments. 
For f_code=2. tiie motion vector components can assume values from -32 to +31.5 in half-pixel increments. 
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Table 1 



5 



10 



f_code 


R 


1 


32 


2 


64 


3 


128 


4 


256 



Motion vectors for the chrominance blocks are derived from those of the luminance blocks. Since one pixel in a 
chrominance block corresponds to two pixels in each direction in the conresponding luminance block, i.e., the chromi- 

75 nance component is at half the resolution of the luminance, the MV for the chrominance block is one-half the MV of the 
luminance block. Moreover, since the MV of the luminance block may have half-pixel values, the MV for the chronti- 
nance block may consequently have quarter pixel values. But. since only half-pixel interpolation is used for MC of the 
chrominance block, the quarter pixel values have to be rounded into half-pixel values. Table 2. below, shows how to per- 
form the required rounding operation. For example, 1/4 is.rounded to 1/2, 2/4 is the same as 1/2 and 3/4 Is rounded to 

20 1/2. 



Table 2 



25 



1/4 pixel value 


0 


1 


2 


3 


1/2 pixel value 


0 


1 


1 


1 



Although only the reconstructed previous frame is available at the decoder for MC. there is a choice at the encoder 
for ME to use either the reconstructed previous frame or the original previous frame, it is advantageous to use the orig- 
30 inal previous frame In ME. but not MC, since the complexity is lower, and the MV may represent true motion more 
closely 50 that the chrominance components may be more accurately predicted. 

For some video sequences, such as where there is fast motion or a scene change, coding the difference block may 
require more bits than direct DCT coding of the actual intensity values of the current block. Accordingly, It Is desirable 
to have a decision criteria for adaptively choosing to code the cun'ent block directly (e.g., INTRA mode) or differentially 
35 (e.g., INTER mode). The following parameters are calculated to make the INTRA/INTER decision: 

mean^^ S Z ^t'-^*) and /l« ^ Z MiJ>niean\. 
^ My«o /-oy-o 

40 

where /Vis the size of the block (e.g., N=16).The INTRA mode is chosen if /\<(SADin^^^/lf\^jf.M\/^)-2*/\/c) . othenwse. 
the INTER mode is used. Note that the MV in SAD[niei{MVx,MVy) for this dedsion is at integer pixel resolution. If the 
INTRA mode is chosen, no further operations are necessary for the motion search. If the INTER mode is chosen, the 

45 motion search continues for the half-pixel MV. 

FIGURE 4 illustrates a scheme for motion estimation with a restricted motion vector in accordance with the present 
invention. /Advanced ME/MC techniques Include unrestricted motion vector, advanced prediction, and bi-directional 
ME/MC. In the basic ME/MC technique, the predicted block is a block in the previous frame. However, if the current 
block is at a corner or border of the current frame, the range of the MV is restricted. An advanced technique is to allow 

so unrestricted MVs for such corner and border blocks. 

With this technique, the previous frame is extended in all four directions (e.g.. left, top, right, and bottom) by repeat- 
ing (e.g., padding) the border pixels a number of times according to a code word (e.g.. f_code. described above in TaUe 
1) which indicates the relative range of motion. With a larger range of motion, a correspondingly larger search area is 
required. The difference block is generated by applying ME/MC against the extended previous frame and taking thedif- 

S5 ference of the cunrent block and the predicted block that may be partially out of the frame boundary. This technique 
improves the coding efficiency of the boundary blocks and can result in an improved image. 

For example, with the basic ME/MC technique, a previous frame 400 includes a predicted block 410 which Is in a 
search range 420. The relative macrobtock (MB) position of the current frame is shown with a dashed line 430. The cor- 
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responding motion vector may be, for example, (MV^. MVy)-(8,0 ) if the predicted block is displaced eight pixels hori- 
zontally to the right and zero pixels vertically. 

FIGURE 5 illustrates a scheme for motion estimation with an unrestricted motion vector in accordance with the 
present invention. Like^numbered elements con^espond to the elements in FIGURE 4. With this advanced ME/MC tech- 
5 nique. the search range 520 can cross over the boundary of the previous frame 400 into an extended previous frame 
500. The corresponding motion vector may be, for example, (MVjj, MVy)=(8,-10) if the predicted block is displaced 
eight pixels horizontally to the right and ten pixels vertically upward, where vertically downward is take as the positive 
direction. 

FIGURE 6 illustrates reordering of pixel lines in an adaptive frame/field prediction scheme in accordance with the 

10 present invention. In a first aspect of the advanced prediction technique, an adaptive technique is used to deckle 
whether a current macroblock of 1&(16 pixels should be ME/MC coded as is, or divided into fbur blocks of 8x8 pixels 
each, where each 8x8 block is ME/MC coded separately, or whether field based motion estimation shouM be used, 
where pixel lines of the macroblock are reordered to group the same-f iekJ lines in two1 6x8 field blocks, and each 16x8 
block is separately ME/MC coded. 

16 A field mode image, e.g., a 1 6x1 6 macroblock, is shown generally at 600. The macroblock includes even-numbered 
lines 602. 604, 606, 608, 610. 612, 614 and 616, and odd-numbered lines 603, 605, 607, 609, 61 1, 613, 615 and 617. 
The even and odd lines are thus interleaved, and form top and bottom (or first and second) fields, respectively. 

When the pixel lines in image 600 are permuted to form same-field luminance btod®, the macroblock shown gen- 
erally at 650 is formed. Anows, shown generally at 645, indicate the reordering of the lines 602-617. For example, the 

20 even line 602, which is the first line of macroblock 600. is also the first line of maaoblock 650. The even line 604 is reor- 
dered as the second line in macroblock 650. Similarly, the even lines 606. 608, 61 0. 61 2, 61 4 and 61 6 are reordered as 
the third through eighth lines, respectively, of macroblock 650. Thus, a 16x8 luminance region 680 with even-nunidered 
lines is formed. Similarly, the odd-numbered lines 603. 605, 607, 609. 61 1 . 613, 615 and 617 form a 16x8 region 685. 
The decision process for choosing the MC mode for P-VOPs is as follows. For frame nrKxJe video, first obtain the 

2S Sum of Absolute Differences (SAD) for a single 1 6x1 6 block, e.g., SAD-^slf^Vx^MVy); and for four 8x8 blocks, e.g., 

SADQ(MV^^.MVy^).SAD^(MV^^.MVy2).SADQ(MV^^,MVy^). andSADQ(MV^^MVy^). 

If 

30 

4 

M 

choose 8x8 prediction: othenvlse, choose 16x16 predtebon. 

For interlaced video. obtain SADtop(MV^_top,MVy_top)SADtonomif^^x_bonom»MVy^bo^ where 
{M^x top*^^y top) anci {MV^ bottomMVy^tottom) are the motion vector for both top (even) and bottomlodd) fields. Then, 
chooie the reference field which has the smallest SAD (e.g., for SAD^^p and SADbottom) ^® ^i^'d half sanple 
40 search. 

The overall prediction mode decision Is based on choosing the minimum off: 

{a)SAD^s(MV^.MVy). 
(b) 

4S 

4 

50 and 

(c) SADtop(MV^^.MVy^iop)'^SADtc^JMVj^i^^ 

If term (a) is the minimum, 16x16 prediction is used. If term (b) is the minimum, 8x8 motion compensation (advanced 
prediction mode) is used. If term (c) is the minimum, f ieki based motion estimation is used. 
55 If 8x8 prediction is chosen, there are fbur MVs for the fbur 8x8 luminance blocks, i.e., one MV fbr each 8x8 block. 
The MV for the two chrominance blocks is then obtained by taking an average of these four MVs and dividing the aver- 
age value by two. Since each MV fbr the 8x8 luminance block has a half-pixel accuracy, the MV fbr the chrominance 
blocks may have a sixteenth pixel value. Table 3, below, specifies the conversion of a sixteenth pixel value to a half-pixel 
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' value for chrominance MVs. For example. 0 through 2/16 are rounded to 0, 3/16 through 13/16 are rounded to 1/2, and 
14/16 and 15/16 are rounded to 2/2ci1 . 



Tables 



1/16 pixel value 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


1/2 pixel value 


0 


0 


0 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 


2 


2 



10 With field prediction, there are two MVs for the two 16x8 blocks. The luminance prediction is generated as follows. 
The even lines of the niacroblock (e.g.. lines 602. 604. 606, 608, 610, 612, 614 and 616) are defined by the top fieid 
motion vector using the reference field specified. The motion vector is specified in frame coordinates such that full pixel 
vertical displacements correspond to even integral values of the vertical motion vector coordinate, and a half-pixel ver- 
tical displacement is denoted by odd integral values. When a half-pixel vertical offset is specified, only pixels from lines 

IS within the same reference field are combined. 

The MV for the two chrominance blocks is derived from the (luminance) motion vector by dividing each component 
by 2, then rounding as follows. The horizontal conponent is rounded by mapping all fractional values into a halfi^ixel 
offset This is the same procedure as described in Table 2. The vertical motion vector component Is an integer and the 
resulting chrominance motion vector vertical component is rounded to an integer. If the result of dividing by two yields 

20 a non-integral value, it is rounded to the adjacent odd integer. Note that the odd integral values denote vertical interpo- 
lation between lines of the same field. 

The second aspect of the advanced prediction technique is overlapped MC for luminance blocks. In the following 
discussion, four MVs are always assumed for a 16x16 luminance block. The case of one 16x16 MV can be considered 
as having four identical 8x8 MVs. Each pixel in an 8x8 luminance predicted block is a weighted sum of three prediction 

25 values specified in the following equation: 

PV.J>{HoV.j) • Q(M>H, V.j) • r(i.jy,H^{i,j) • s(/./))/8 

where division by eight is with rounding off to the nearest half-pixel, with rounding away from zero. The weighting matri- 
30 ces for Ho(/,y), H^C/jl and H2(/.y) are specified in Tables 4-6, respectively, below. (i,j)=(0,0) is the upper left hand value 
in each table, and (i,i)=(7,7) is the lower right hand corner value. 

In Table 5, the top four rows indicate the top neighbor motion vector weights, while the bottom four rows indicate 
the bottom neighbor motion vector weights. In Table 6. tiie four left-hand columns indicate the left-hand neighbor motion 
vector weights, while tiie four right-hand columns indicate the right-hand neighbor motion vector weights; 

35 



Table 4 



40 



45 



SO 



4 


5 


5 


5 


5 


5 


5 


4 


5 


5 


5 


5 


5 


5 


5 


5 


5 


5 


6 


6 


6 


6 


5 


5 


5 


5 


6 


6 


6 


6 


5 


5 


5 


5 


6 


6 


6 


6 


5 


5 


5 


5 


6 


6 


6 


6 


5 


5 


5 


5 


5 


5 


5 


5 


5 


5 


4 


5 


5 


5 


5 


5 


5 


4 


Tables 
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2 


2 


2 


2 


2 


2 


2 


1 


1 


2 


2 


2 


2 


1 


1 
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Table 5 (continued) 
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1 




1 


1 


1 


1 


1 








1 


1 


1 


1 


1 








1 


1 


1 


1 


1 








1 


1 


1 


1 


1 








2 


2 


2 


2 


1 




2 


2 


2 


2 


2 


2 


2 


2 



Tablee 
1 I 1 I 1 
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2S 



2 


1 


1 




1 


1 


1 


2 


2 


2 


1 




1 


1 


2 


2 


2 


2 


1 




1 


1 


2 


2 


2 


2 


1 




1 


1 


2 


2 


2 


2 


1 




1 


1 


2 


2 


2 


2 


1 




1 


1 


2 


2 


2 


2 


1 




1 


1 


2 


2 


2 


1 


1 




1 


1 


1 


2 



The values of are the pixels of the previous frame, defined as follows. 

as 

40 where (M\/x . Wy ) is the MV of the cun^ent 8x8 luminance block. (MVx ,MVy ) is the MV of the block either above (for 
j = 0. 1 .2,3) or below (for j = 4,5.6.7) the current block, and {MVl ,MVy ) is the MV of the block either to the left {for 
1=0,1 ,2.3) or right (for i = 4,5,6,7) of the current block. 

FvlGURE 7 illustrates a current field mode macroblock with neighboring frame mode blocks having associated can- 
didate motion vector predictors in accordance with the present invention. A P-VOP may be used. When using INTER 

4S mode coding, the motion vectors for the current image blocks must be transmitted. The motion vectors are coded dif- 
ferentially by using a spatial neighborhood of motion vectors which are already transmitted. That is, since data is proc- 
essed from top to bottom and from left to right aaoss an image in virtually all vMeo standards, motion vectors from 
blocks which are above and/or to the left of a current block are available for processing the current bkx^k. These motkm 
vectors are therefore candidate predictors for differential coding. 

so Motion vector coding is performed separately on the horizontal and vertical components of the cun^ent block. For 
each MV component in a P-VOP. for example, the median value of the candidate predictors for the same component 
may be computed, and the difference value between component and median values may be coded using variable 
length codes. 

When interlaced coding tools are used, candidate predictors for field-based motion vectors in a current P-VOP can 
55 be obtained as follows. Let macroblock 700 be a current field mode macroblock (e.g.. 16x16 pixels). Surrounding nnac- 
robiocks include a macroblock 710 which immediately precedes the current macroblock 700 in the current row 715, a 
macroblock 720 which is immediately above the current macroblock in a preceding row 725, and a macroblock 730 
which immediately follows the macroblock 720 in the preceding row. 
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The macroblock 700 has associated first field horizontal and vertical motion vectors MV^n and MVyi^. respectively, 
and second field horizontal and vertical motion vectors MVxf2 and MVyf2. respectively. Vertical and horizontal motion 
vector conponents are not shown separately in FIGURE 7 for simplicity. For example, assume the first field includes 
even-numbered rows, and the second field includes odd-numbered rows. Moreover, in the example shown, advanced 

5 prediction mode is used for macroblocks 710, 720 and 730, so that macrot^ock 710 includes an 8x8 candidate block 
712 with associated horizontal and vertical motion vectors MVI^ and MVI^ respectively, macroblock 720 includes an 
8x8 candidate block 722 with associated horizontal and vertical motion vector components MV2x and MV2y, respec- 
tively, and macroblock 730 includes an 8x8 candidate block 732 with associated horizontal and vertical motion vector 
components MV3x and MV3y, respectively However, it also possible for any or all of the candidate blocks 712, 722 and 

10 732 to be macroblocks, i.e., where advanced prediction mode is not used. None of the macroblocks 710. 720. 730 is 
field predicted in the present illustratton. 

When a particular 8x8 sub-block of a macroblock is used as a candidate, the macroblock will have three other sub- 
blocks with associated horizontal and vertical motion vector components which are suitable for use in differentisdiy 
encoding the motion vector components of the current field coded macroblock 700. Generally, is desirable to select the 

IB sub-block in the particular maaoblock which is closest to the upper left-hand portion of the cun-ent macroblock as the 
candidate block as shown. 

Predictor horizontal and vertical motion vector Components, Px and Py, respectively, can be determined in accord- 
ance with the present invention from Px=median (MVIx. MV2x, MV3x) and Pysmedian(MV1yi MV2y MV3y). It has been 
found that the use of the median provides efficient coding. The median is the middle number in an ordered sequence 

20 having an odd number of elements, or the average of the two middle numbers of a sequence having an even nunri^er 
of elements. For example, the median of the sequence (1, 2, 4) is 2, and the median of the sequence (1, 2, 4, 10) is 3. 
The median is the same as the average when there are two numbers in a sequence. 

Other functions besides the median may be used. For example, the average may be used, i.e.. 
Px-1/3MV1x + 1/3MV2x + 1>3MV3x. Alternatively, some other weighting scheme may be used, e.g., 

^ P x-0.4 MV1 X -f 0.4 MV2 x + 0.2 MV3 x - Moreover, while three candidate blocks are used in the present example, two 
or more may be used. Furthermore, the location of the candidate blocks may vary. For example, a candidate block (not 
shown) which immediately precedes macroblock 720 may be used. Moreover, for coding schemes which employ suffi- 
cient buffering capability, candidate blocks which follow the current macroblock 700 in the current row 715, or a subSB- 
quent row (not shown) may be used. 

30 For differential coding of the current macroblock to obtain a motion vector difference value, MVD, in accordance 
with the present invention, both fields use the same predictor. That is, MVDxfi'^^xfr^x* MVDy,^BMVy|^-Py, 

MVDxe=MVxe-Px.and MVDy,2«MVy,2-Py 

FIGURE 8 illustrates a cun^ent field mode macroblock with neighboring frame nfuxie blocks and field mode macrob- 
lock having associated candidate motion vector predictors in accordance with the present invention. A P-VOP may be 
35 used. Uke-numbered elements correspond to the elements in FIGURE 7. Vertical and horizontal motion vector compo- 
nents are not shown separately in FIGURE 8 for sirTiplicity. Here, candidate nrmcroblock 820 which is immediately above 
thee current macroblock 700 in the previous row 725 is field coded. Thus, macroblock 820 has associated first and sec- 
ond field vectors, including horizontal motion vector components MV2xfi and MV2xi2. and vertical motion vector com- 
ponents, MV2yf^ and MV2yi2. 

40 Generally, when the cunrent macroblock is field predicted, and at least one of the spatial neighborhood macroblocks 
is field predicted, then the candidate motion vector predictors can be generated by using the same field of the candidate 
blocks. That is. for the first field of the current macroblock, the first field motion vector(s) of the surrounding field pre- 
dicted macroblock(s) are used. Similarly, for the second field of the cun'ent macroblodi. the second field motion vec- 
tor(s) of the surrounding field predicted macroblock(s) are used. 

45 Specifically, the first field horizontal predictor is Pxii=median(MV1x, MV2xfi, MVSJ, the first field vertical predictor 
is Pyf^smedian (MVIy. MV2yfi, MV3y). the second field horizontal predictor is Pxf2-niedian(MV1x. MV2xf2. MV3x). and 
the second field vertical predictor is Pyf2=median(MV1y. MV2yf2. MV3y). The motion vector difference values are 
MVD^,=MV^,- Px,! . MVDy,,=MVy,,.Py,, . MVDx,2=MVxK,-Px«>. and mo^^^m^- P^, 

Alternatively, the first and second field motion vectors of maaoblock 820 (and any other field mode candidate mac- 

60 roblock) may be averaged to obtain averaged horizontal and vertical motion vector components. The processing then 
proceeds as discussed in connection with FIGURE 7. Specifically, an averaged horizontal motion vector component for 
macroblock 820 is MV2x=(MV2xii+MV2xi2)/2 . while an averaged vertical motion vector component for macroblock 
820 is MV2y=(MV2y,i+MV2y^2)^. Predictor horizontal and vertical motion vector components, respectively, are 
Px=median(MV1x, MV2x, MVaj and Py=median(MV1y MV2y MV3y). The motion vector difference values for the cur- 

55 rent macroblock 700 are MVDx,i=MVx,i-Px. MVDy,i=MVy,i-Py. hA\/D^=hA\/^'F^,a3f)6 MVDy,2=MVy^-Py . 

When two or more of tiie candkJate macroblocks are field predicted, processing may proceed as above for eadtt 
field predicted macroblock. 

For coding efficiency, to ensure that the vertical component of a fieU motion vector is an integer, the vertical differ- 
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ence motion vector component is encoded in the bitstream as MVD y,i={MVy|^ -intCP y))/2 , where int (Py) means trun- 
cate Py in the direction of zero to the nearest integer. This assures that all fractional pixel offsets are mapped to a half- 
pixel displacement. For example, if MVyti=4 and Py=3.5. then MVDy|i=(MVyji-int(Py))/2=(4- int(3.5))^=(4-3)y2=:0.5 . 
Othenwise. without the "inf function, MVDy,^=(MVy,^-Py)y2=(4- 3.5)/2=(0.5)/2=0.25 , which cannot be coded as effi- 
5 ciently. The factor of 1/2 is used to reduce the magnitude of MVDyfi , thus making it more efficiently coded by the motion 
vector VLC. 

FIGURE 9 illustrates a current advanced prediction mode block with neighboring frame mode blocks and a field 
mode macrobtock having associated candidate motion vector predictors in accordance with the present invention. A P- 
VOP may be used. Uke-numbered elements con^espond to the elements in FIGURES 7 and 8. Vertical and horizontal 

10 motion vector components are not shown separately in FIGURE 9 for simplicity. The current block 91 2 in a cunrent mac- 
roblock 900 is shown as an advanced prediction 8x8 block. Alternatively, the cunrent block 912 may be a frame mode 
(progressive) macroblock which is the same as macrobtock 900. Recall that with advanced prediction mode, each of 
the four 8x8 blocks in a macroblock is ME/MC coded separately. 

Generally, when the current block Is coded as a progressive macroblock or an advanced predicb'on (8x8) block, and 

IS at least one of the coded spatial neighborhood macroblocks is field predicted, the candidate motion vector components 
can be generated by averaging the first and second field motion vector components, by using both of the first and sec- 
ond field motion vector components as candidates, or by using one of the first and second field motion vector compo- 
nents, but not both. Specifically, in a first option, the motion vector components of the interlaced candidate 
macroblock(s) are averaged within the block. For example, for macroblock 820, averaged motion vector components 

20 are MV2 x*(MV2 +MV2 y^)f2 , and MV2 y=(MV2 yj, +MV2 y|2)/2 . Predictor horizontal and vertical motion vector com- 
ponents, respectively, are Px=median(MV1x, MV2x, MV3x) and Py=median (MVIy, MV2y, MV3y). and the motion vector 
difference values for the cunrent block 912 are MVD ^^MV ^-P ^ and MVD y=MVy-P y . MVx and MVy are the horizontal 
and vertical motion vector components, respectively, of the current block 912. 

In a second option, both of the f iekl motion vectors of the macroblock 820 are candidate predictors. For example, 

25 for macroblock 820. the predictor horizontal and vertical motion vector components, Px and Py. respectively, can be 
determined from Px=median(MV1x, MV2xfi. MV2xf2. MVSJ and Py=median(MV1y MV2y,i, MV2y|2, MVy). and the 
motion vector difference values for the cunrent block 912 are MVD x-MV^^-P^^ and MVD y=MV y-P y . 

In a third option, the first f iekJ motion vectors of tiie macroblock 820 are candidate predictora For example, fbr mac- 
roblock 820, the predictor horizontal and vertical motion vector components, respectively, are Px=median(MV1x. 

30 MV2xfi. MV3x) and Py=median(MV1y MV2y<^, MVy). The motion vector difference values for the current block 912 are 
MVDx=MVx-Px and MVDy=MVy-Py. 

In a fourth option, the second field motion vectors of the macroblock 820 are candidate predictors. For example, for 
macroblock 820, the predictor horizontal and vertical motion vector components, respectively, are Px-median(MV1x, 
MV2xf2. MV3x) and Py:smedian(MV1y. MV2yi2. MVy). The motion vector difference values fbr the cun'ent block 912 are 

35 MVDx=MVx-Px andMVDy=MVy-Py. 

As discussed previously, when averaging pixel data from first and second fields, all fractional pixel offsets are 
mapped to a half-pixel displacement fbr coding efficiency. 

FIGURE 10 illustrates macroblock-based VOP padding fbr motion prediction in accordance with ttie present inven- 
tion. At an encoder, padding is used to Increase tiie area of a reference image fbr motion estimation prior to motion 

40 compensation. The technique Is particularly suited fbr use with arbitrarily shaped video object planes (VOPs). A mac- 
roblock can similarly be padded at a decoder as soon as tiie macroblock is reconstructed. For example, a star-shaped 
VOP 1 01 0 in a frame or otiier image region 1 000 nfiay be padded to fill blocks which are at tiie boundary of the VOP as 
well as other neighboring macroblocks. 

The blocks designated by light shading (e.g., such as block 1020) are all boundary blocks, and are processed with 

45 normal padding. Blocks designated with darker shading (e.g., such as block 1 030) are adjacent to the kx)undary blocks 
and are processed with extended padding. Further extended block (not shown) may also be padded. The amount of 
padding required is related to tiie parameter f.code discussed above in connection with Table 1. The blocks may be 
16x16 luminance blocks or 8x8 chrominance blocks. 

Padding fills the areas outside the VOP by repeating tiie boundary pixels of the VOR If a pixel outside tiie VOP can 

so be padded by the repetition of more than one boundary pixel, the average of those particular boundary pixels is used. 
Normal padding refers to padding within a boundary block. First, each horizontal line of the blocks in frame 1000 is 
scanned to provide continuous line segments which are either Interior to the VOP (e.g., including boundary pixels of the 
VOP), or exterior to the VOP. If the entire line in a block is interior to tiie VOP, no padding is performed. 

If there are both interior and exterior segments in a block, and the exterior segment is positioned between an inte- 

55 rior segment and the edge of tiie block, the pixels in the exterior segment are set to the interior segment pixel value at 
the boundary of the VOP which is closest to that particular exterior segment. For example, for the left to right sequence 
E1-E5 and 16-116 in a 16 pixel scan line, where "E" denotes and exterior pixel and "I" denotes an interior pixel. E1-E5 
are set to 16. For tiie sequence El -E5. 16-110 and El 1 -El 6, El -E5 are set to 16. and El 1 -E16 are set to 110. 
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If an exterior segment is between two interior segments, the exterior segment is filled with the average of the two 
boundary pixels of the interior segments. For example, for the sequence 11-15. E6-E10 and II M16. E6-E10 are set Id 
(15+11 1)/2. 

The above process is repeated for each horizontal and vertical scan line in each block. If a pixel can be padded by 
5 both horizontal and vertical boundary pixels, the average is used. 

It is possible for an exterior line segment to extend horizontally across a block without encountering an interior line 
segment In this case, for each pixel in the line segment, scan horizontally In both directions to find the closest padded 
exterior pixel. If there is a tie (i.e.. padded pixels to the right and left are equidistant from a cun^ent pixeQ. use the pixel 
to the left of the cun^ent pixel. 

10 Similarly, it is possible for an exterior line segment to extend vertically across a block without encountering an inte- 
rior line segment. In this case, for each pixel in the line segment, scan vertically in both directions to find the closest 
padded exterior pixel. If there is a tie (i e., padded pixels above and below are equidistant from a cun^ent pixel), use the 
pixel above the current pixel. The exterior pixel is then replaced by the average of the pixels found in the horizontal and 
vertical bi-directional scans. 

16 However, when the VOP is interlaced coded, a modified padding technique as desaibed below may be used. 

FIGURE 1 1 1llustrates repetitive padding within a macroblock for motion prediction in accordance with the present 
invention. A block such as a 16x16 luminance macroblock 1 100 is shown. Each pixel location is designated by an {i.j) 
coordinata The (i.j) coordinate system used in FIGURES 1 1 and 12 does not necessarily con'esponding to (i.D coordi- 
nates or variables used elsewhere in the description. For example, the upper left hand pixel is designated (0,0). E&tih 

20 column of pixels in the maaoblock 1 100 is numbered from 0 to 15. while each row is also numbered from 0 to 15. 

Pixel location which are shaded are part of a VOP. For example, pixels (0,6-15), (1,7-15), (2.8-15), (3,9-15), (4,10- 
15), (5.11-15). (6,12-15), (7.13-15). (8, 14 and 15), (9,15). (12.1^, (13,14 and ^S|. (14, 13-1 5) and (15, 12-1 5) are part 
of a VOR Unshaded (i.e., exterior) pixels are not part of the VOR Using the padding technique discussed in connection 
with FIGURE 10, the exterior pixels can be padded with boundary (e.g., interior) pixel values of the VOP. For example. 

25 exterior pixels (0,0-5) are set to VOP boundary pixel (0,6). Exterior pixel (1 .6) is set to the average of boundary pixels 
(0.6) and (1 .7). Exterior pixel (10,15) is set to the average of boundary pixels (9,15) and (12,15). Exterior pixel (9.14) is 
set to the average of twundary pixels (8,14) and (9.15). and so forth. 

FIGURE 12 Illustrates repetitive padding within a macroblock for motion prediction after grouping same-field pixel 
lines in accordance with the present invention. When VOPs are interlaced coded, a modified padding technique is need 

30 for luminance pixel values of a reference VOP used for ME/MC. In accordance with the present invention, the luminance 
pixel values are split into top and bottom fields, and padding is performed in each field separately. For example, as 
shown in block 1200. the even rows of pixel data from frame 1 100 of FIGURE 1 1 are split into a top field block 1210 
(e.g., VOPJop) and a bottom field block 1220 (e.g., VOP.bottom), each of which is 16x8. Column numbers, indicated 
by "i", extend from 0-15, while row nun*ers. indicated by "j". extend from 0. 2, 4 14, and from 1, 3, 5 15. 

3S Each pixel value in the block 1 200 can therefore be described by an (i.j) coordinate. 

Top field block 1210 includes rows 0, 2, 4, 6, 8, 10, 12 and 14, while bottom field 1220 block includes rows 1, 3. 5, 
7,9, 11. 13 and 15. 

Next, each line in the respective blocks is scanned horizontally to provide exterior and interior line segments as dis- 
cussed previously. For example, (0-8,14) and (13-15,14) are interior line segments, while (9-12,14) is an extertor line 
40 segment 

Repetitive padding is then applied separately for each field. f=br exanrple, in the top field block 1210, exterior pixels 
(0.0). (0.2) and (0.4) are set to the value of interior pixel (0.6). exterior pixel (9.14) is set to the average of the values of 
boundary pixels (8.14) and (13,14), exterior pixel (1,6) is set to the average values of boundary pixels (0,6) and (1,8), 
and so forth. In the bottom field block 1 220, exterior pixels (0,1), (0,3) and (0,5) are set to the value of interior pixel (0,^. 

45 Lastly, after padding, the two field blocks 1210 and 1220 are combined to form a single luminance padded ref^- 
ence VOP. That is, the lines are reordered in the interieaved order shown in FIGURE 11. 

FIGURE 13 is a block diagram of a decoder in accordance with the present invention. The decoder, shown gener- 
ally at 1300. can be used to receive and decode the encoded data signals transmitted from the encoder of FIGURE 2. 
The encoded video Image data and differentially encoded motion vector data are received at terminal 1340 and pro- 

so vided to a demultiplexer (D EM UX) 1 342. The encoded video image data is typically differentially encoded In DOT trans- 
form coefficients as a prediction en^or signal (e.g., residue): 

A shape decoding function 1344 processes the data when the VOP has an arbitrary shape to recover shape infbr- 
matlcn. which is, in turn, provided to a motion compensation function 1350 and a VOP reconstruction function 1352. A 
texture decoding function 1346 performs an inverse DOT on transform coefficients to recover residue information. For 

55 INTRA coded macroblocks, pixel information is recovered directly and provided to the VOP reconstruction function 
1352. For INTER coded blocks and macroblocks. the pixel information provided from tiie texture decoding function 1348 
to the reconstructed VOP function 1352 represents a residue between the current macroblock and a reference maaob- 
lock. 
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For INTER coded blocks and macroblocks. a motion decoding function 1348 processes the encoded motion vector 
data to recover the differential motion vectors and provide them to the motion compensation function 1350 and to a 
motion vector memory 1349, such as a RAM. The motion compensation function 1350 receives the differential motion 
vector data and determines a reference motion vector (e.g., motion vector predictor) in accordance with the present 

5 invention. The reference motion vector is obtained from one or more of the macroblocks which are in a spatial neigh- 
borhood of the current macroblock. 

For example, when the encoder provides a reference motion vector which is the median of three neighboring mac- 
roblocks. the motion compensation function 1350 must re*calculate the median motion vector components (e.g., hori- 
zontal and vertical), and sum the median components with the differential motion vector components of the current 

10 macroblock to obtain tiie full motion vector for the current macroblock. The motion compensation function may also 
need to have circuitry for averaging motion vector components of top and bottom f iekis of a field coded neighboring 
macroblock. 

Thus, tiie motion vector memory 1349 is required to store tiie full motion vectors of the neighboring macroblocks 
once these full motion vectors are determined. For example, using the scheme disclosed in FIGURES 7-9. the motion 

16 vectors of the macroblock which immediately precedes tiie current macroblock in the current row must be stored, along 
witii preceding row macroblocks which are directly above, and above and to the right, of tiie cun'ent macroblock For 
row by row processing of a video frame or VOP, the memory 1349 may need to store up to one row of motion vector 
data. This can be seen by noting that when the full motion vector for the current macroblock is determined, tills value 
must be stored for use in determining a subsequent reference motion vector since the cun'ent macroblock will be a 

20 neighboring macroblock in a preceding row when the next row is processed. Furthermore, when the neighkx>ring mac- 
roblocks are field coded, motion vector components for both top and bottom fields must be stored. 

Once the motion conrpensati'on function 1350 determines a full reference motton vector and sums it with the differ- 
ential motion vector of the current macroblock. the full motion vector of the cunrent macroblock is available. Accordingly, 
the motion compensation function 1350 can now retrieve anchor frame best match data from a VOP memory 1354, 

25 such as a RAM, and provide the anchor frame pixel data to the VOP reconstruction function to reconstruct the current 
macroblock. In accordance with the present invention, the image quality of tiie reconstructed macroblock is improve by 
using the full motion vectors of neighboring macroblocks to determine the reference motion vector. 

Padding may also be performed by the motion compensation function 1350. The retrieved best match data is added 
back to the pixel residue at the VOP reconstruction function 1352 to obtain tiie decoded cunrent macroblock or block. 

30 The reconstructed block is output as a video output signal and also provided to tiie VOP memory 1354 to provide new 
anchor frame data. Note that an appropriate video data buffering capability may be required depending on the frame 
transmission and presentation orders since tiie anchor frame for P-VOPs is tiie temporally future frame in presentation 
order. 

Generally, the decoder will perform the same steps as tiie encoder to determine tiie median or otiier value which 
35 was used as the motion vector predictor for the current VOP or block. Alternatively, when tiie motion vector predictor is 
the same as the motion vector of one of the candidate macroblocks, it is possible to transmit a code word to the decoder 
which designates the particular macroblock. For example, a code 00 may mean tiie motion vector of tiie previous block 
in the same row was used, a code 01 may mean the motion vector of tiie block above tiie cun'ent block in tiie previous 
row was used, and a code 1 0 may mean the motion vector of the next block in the previous row was used. In this case, 
40 the decoder can directly use tiie motion vector predictor of tiie designated macroblock and need not access tiie motion 
vectors of each of the candidate macroblocks to recalculate the motion vector predictor. A code of 1 1 may mean that 
the motion vector predictor is different than the motion vectors of all of the candidate macroblocks. so the decoder must 
recalculate the motion vector predictor. Those skilled in the art will appreciate that tiie necessary operations can be 
implemented in software, firmware or hardware. This processing can be implemented witii a relatively low cost and low 
45 complexity 

FIGURE 14 illustrates a macroblock layer structure in accordance with the present invention. The structure is suit- 
able for P-VOPs, and indicates the format of data received by tiie decoder. A first layer 1410 Includes fiekls 
first jshape.code, MVD_sh, CR. ST and BAG. A second layer 1430 includes fields COD and MCBPC. A third layer 1450 
includes fields ACjDredJlag. CBPY. DQUANT. Interlaced Jnformation. MVD. MVD2. MVD3 and MVD4. A fourth layer 

50 includes fields CODA. Alpha.ACpred JIag. CBPA, Alpha Block Data and Block Data. Each of tiie above fields is defined 
according to tiie MPEG-4 standard. 

The field Interlaced Jnformation in the third layer 1450 indicates whettier a macroblock is interlaced coded, and 
provides field motion vector reference data which informs tiie decoder of tiie coding mode of tiie current macroblock or 
block. The decoder uses tills information in calculating the motion vector for a current macroblock. For exanple. if the 

55 current macroblock is not interlaced coded but at least one of the reference macroblocks is. tiien the decoder will aver- 
age tiie motion vector components of each of the interlaced coded reference macroblocks for use in determining a ref- 
erence motion vector for the current macroblock. AKernativeiy. tiie motion vector from tiie top or bottom field in the 
reference macroblock is used, but not both. If the current macroblock is interlaced coded, then tiie decoder will know to 
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calculate a reference motion vector separately for each field. The coding mode may also designate which of the candi- 
date n^croblocks» if any. has a motion vector which is the same as the reference motion vector used in differentially 
encoding the motion vector of the current macroblock. 

The Interlacedjnformation field may be stored for subsequent use as required In the motion vector memory 1349 

5 or other memory in the decoder. 

The Interlacedjnformation field may also include a flag dctjype which indicates whether top and bottom field pixel 
lines in a field coded macroblock are reordered from the interleaved order, e.g., for padding. 

It will be appreciated that the an^ngement shown in FIGURE 14 is an example only and that various other anrange- 
ments for communicating the relevant information to tiie decoder will become apparent to tiiose skilled in the art. 

10 A bitstream syntax for use in accordance with the present invention is now described. MPEG-4 provides a video 
syntax with a dass hierarchy, including a Video Session (VS), Video Object (VO). Video Object Layer (VOL) or Texture 
Object Layer (SOL), Group of Video Object Plane, and at the bottom, a Video Object Plane. The Video Object Plane 
Layer syntax, set forth beiow. can indicate whetiier a current macroblock is interlaced coded in accordance with the 
present invention as shown below. The syntax which is not shaded is part of the present invention (e.g., interlaced; if 

IS (interlaced) and topjieldjirst) Here, the term "interlaced''=1 if the current macroblock is interlaced coded. The term 
"topjieldjirst" indicates that tiie top field of the current macroblock is to be processed first. Other terms are defined in 
the aforementioned "MPE&4 Video Verification Model Version 7.0". Only a portion of the conventional syntax is shown 
for conpactness, witti the omitted portions being designated by three vertically arranged dots. 
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A more detailed macroblock layer syntax in accordance with the present invention when the Interlaced Jnfornriation 
fieldsl is shown below. fieid_prediction=1 if the cunrent macroblock is interlaced predicted. The referenced field flags 
have a value of zero for the top field and a value one for the bottom field. For P*VOPs. when fieldjDredictionsi, two 
motion vector differences follow the syntax, with the top field motion vector followed by the bottom field motion vector. 

The syntax also accounts for B-VOPs. In particular, for B-VOPS, when f ield_prediction=1 , two or four motion vector 
differences are encoded. The order of motion vector differences for an interpolated macroblock is top field forward, bot- 
tom field fbnward, top field backward, and bottom field backward. For unidirectional interlaced prediction (e.g., fbnrard 
or backward only), the top field motion vector difference is followed by the bottom field motion vector difference. 
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Syntax 



No. o£ 
bits 



Format 



if 
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IS 



(interlaced) { 

if ( (rabtype == INTRA) | | 

(tnbtype INTRAJJ) 1 1 

(cbp !« 0)) 

datatype 

if ((P VOP && ((tnbtype == INTER) || 

(tnbtype «= INTER^Q) ) ) || 
(B_VOP && (rabtype != Directjtnode) ) ) { 
f ieTdjprediction 
if (fieldjprediction) { 
if (P_VOP 11 

(B_VOP && (tnbtype U Baclcward))) 
£orward_top_f ialdjref erencs 
f orwaxdJbottGaB_f leld_ref erence 



20 



SS 



(B-VOP && (tnbtype Forward)) { 
bac3cward_top_f ield_refereace 
baclcward bottom field reference 



1 
1 

1 
1 



uimbsf 



uinbsf 



uimbsf 
uimbsf 

uimbsf 
uimbsf 



30 Accordingly, it can be seen that the present invention provides a method and apparatus for coding of digital video 
images such as video object planes (VOPs). and. in particular, to motion estimation and compensation techniques for 
interlaced digital video. A technique for providing predictor motion vectors for use in differentially encoding a cunrent 
field predicted macroblock uses tiie median of motion vectors of surrounding blocks or macroblocks. When a surround- 
ing nf\acroblock is itself interlaced coded, an average motion vector for that macroblock can be used, witii fractional pixel 

55 values being mapped to the half-pixel. When tiie cunrent block is not interlaced coded but a surrounding block is, the 
field motion vectors of the surrounding block may be used individually or averaged. 

A decoder in accordance vi^ith the present invention uses a bitstream syntax to determine the coding mode of a 
macroblock, e.g., such as whether a macroblock is field coded. TTie decoder may store tiie coding mode of neighboring 
macroblocks for later usa 

40 In a repetitive padding technique for an interlaced coded VOR the top and bottom lines of the VOP and sunrounding 
block are grouped. Within each field, exterior pixels are padded by setting tiiem to tiie value of the nearest boundary 
pixel, or to an average of two boundary pixels. The lines are then reordered to provide a single preference VOP image. 

Although the invention has been described in connection witti various specific embodiments, tiiese skilled in the art 
will appreciate that numerous adaptations and modifications may be made thereto wittiout departing from tiie spirit and 

45 scope of tiie invention as set forth in tiie claims. 

Claims 



1. A metiiod for providing horizontal and vertical motion vector components for use in differentially encoding respec- 
so tive horizontal and vertical motion vector components of a current block of a digital video image, wherein: 

candidate first, second and third blocks have associated horizontal and vertical motion vector components; 
said first block being at least a portion of a first macroblock which immediately precedes sakJ cunrent block In 
a current row; 

55 said second block being at least a portion of a second macroblock which Is immediately above said cunrent 

block in a preceding row; 

said tiiird block being at least a portion of a third macroblock which Immediately follows said second maaob- 
lock in said preceding row; and 
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IS 



at least one of said first, second and third candidate blocks and said, current block is field-coded; 
said method comprising at least one of the steps of: 

(a) selecting a horizontal motion vector component for use in differentially encoding the horizontal motion 
vector component of said current block according to a value derived from the horizontal motion vector conv 
ponents of said first, second and third candidate blocks; and 

(b) selecting a vertical motion vector component for use in differentially encoding the vertical motion vector 
component of said current block-according to a value derived from the vertical motion vector components 
of said first, second and third candkJate blocks. 

The method of claim 1, wherein at least a particular one of said first, second and third macroblocks comprises a 
plurality of blocks, each of which have associated horizontal and vertical motion vector components suital)le for use 
in differentially encoding said motion vector components of said cun-ent blod^ said method comprising the further 
step of: 

selecting one of said plurality of blocks in said particular macroblock which is closest to an upper left-hand por- 
tion of sakJ current block as the candidate block of said particular macroblock. 



3. Tlie method of claim 1 or 2. comprising the further step of: 

20 

providing data for transmission with said at least one differentially encoded motion vector component Indicating 
whether said cun'ent block is field coded. 

4. Themethodof one of the preceding claims, wherein: 

2S 

said current block is a progressive-coded or advanced prediction-coded block; and 

at least one of said candidate blocks is a field coded candidate macroblock having first and second f ietols. each 

w'rth respective horizontal and vertical motion vector componenta 

30 5. The method of claim 4, wherein: 

said horizontal motion vector component selecting step makes a selection according to a median of the tiori- 
zontal motion vector components of said candidate blocks, including said conresponding horizontal motion vec- 
tor components of said first and second fields of said at least one f iekl coded candidate macroblock; and 
35 sakil vertical motion vector component selecting step makes a selection according to a median of the vertical 

motion vector components of said candidate t}locks, including said corresponding vertical motion vector com- 
ponents of said first and second fields of sakJ at least one f iekj coded candidate macrobk)Gk. 

6. The method of claim 4 or 5. comprising the further steps of: 

40 

averaging the respective first and second fiekJ horizontal motion vector components of the at least one field 
coded candidate macroblock to obtain at least one conresponding averaged horizontal motion vector compo- 
nent; 

wherein said horizontal motion vector component selecting step makes a selection according to a 

45 median of the horizontal motion vector components of said candidate blocks otiier than said at least one f ieU 

coded candidate macroblock, if any. and said at least one conresponding averaged horizontal motion vector 
component; and 

averaging the respective first and second field vertical motion vector components of the at least one field coded 
candidate macroblock to obtain at least one corresponding averaged vertical motion vector component; 
so wherein: 

said vertical motion vector component selecting step makes a selection according to a median of the vertical 
motion vector components of said candidate blocks other than said at least one field coded candidate macrob- 
lock, if any, and said at least one conresponding averaged vertical motion vector component. 

65 7. The method of claim 6, wherein: 

die first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such tiiat all fractional pixel offsets are mapped to a half-pixel displacement; and 
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the first and second field vertical motion vector coniponents of tiie at least one field coded candidate macrob- 
lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 

8. The method of one of claims 1 to 3. wherein said current block is a field coded macroblock having first and second 
fields, each with respective horizontal and vertical motion vector components, said method comprising the further 
steps of: 

selecting a horizontal motion vector component for use In differentially encoding said horizontal motion vector 
components of said first and second fields of said current field coded macroblock according to a value derived 
from the horizontal motion vector components of said first, second and tiiird candidate blocks; and 
selecting a vertical motion vector component for use in differentially encoding said vertical motion vector com- 
ponents of said first and second fields of said current field coded macroblock according to a value derived from 
the vertical motion vector components of said first, second and ttiird candidate blocks. 

9. The method of claim 8, wherein at least one of said candidate blocks is a field coded candidate macroblock having 
first and second fields, each with respective horizontal and vertical nfK)tion vector components. 

10. The metiiod of claim 9, wherein: 

said horizontal motion vector component electing st^ makes a selection for encoding said horizontal nfx>tion 
vector component of said first field of said current macroblock according to a median of (i) the horizontal motion 
vector conponents of said candidate blocl^ other tiian sakJ at least one field coded candidate macroblock. if 
any, and (ii) the horizontal motton vector conrtponents of the first fieki of said least one field coded candkiate 
macroblock; 

said horizontal motion vector component selecting step makes a selection fbr encoding sakJ horizontal motion 
vector component of said second field of said current macroblock according to a median of (i) the horizontal 
motion vector components of said candidate blocks otiier tiian said at least one field coded candidate macrob- 
lock. if any. and (ii) tiie horizontal motion vector components of the second field of said at least one field coded 
candidate macroblock; 

said vertical motion vector component selecting step makes a selection for encoding saU vertical nrxstion vector 
conponent of said first field of said cunent macroblock is determined according to a median of (i) the vertical 
motion vector components of said candidate blocks ottier tiian said at least one field coded candidate macrob- 
lock. if any. and (ii) the vertical motion vector components of tiie first f leM of sakJ least one field coded candi- 
date macroblock; and 

said vertical motion vector component selection step makes a selection fbr encoding said vertical nx)tion vector 
component of said second f ieki of said current macroblock is determined according to a median of (i) the ver- 
tical motion vector components of said candidate blocks otiier tiian said at least one field coded candidate 
macroblock, if any, and (ii) tiie vertical motion vector components of tine second f iekl of said at least one f ieU 
coded candidate macroblock. 

1 1 . The method of claim 9. comprising the furtiier steps of: 

averaging the respective first and second field horizontal motion vector components of the at least one fieU 
coded candidate macroblock to obtain at least one conresponding averaged horizontal motion vector compo- 
nent; wherein: 

the selected horizontal motion vector component for use in differentially encoding the horizontal motion vector 
component(s) of at least one of said first and second fieMs of said cun-ent macroblock is determined according 
to a median of (i) tiie horizontal motion vector components of said candidate blocks otiier than said at least one 
field coded candidate macroblock. if any, and (ii) said at least one conesponding averaged horizontal motion 
vector component; and 

averaging the respective first and second field vertical motion vector components of the at least one field coded 
candidate macroblock to obtain at least one corresponding averaged vertical motion vector component; 
wherein: 

tiie selected vertical motion vector component for use in differentially encoding the vertical motion vector com- 
ponent(s) of at least one of sakJ first and second fields of said cun^ent macroblock Is detemiined according to 
a median of (i) tiie vertical motion vector components of said candidate blocks other than said at least one fiekl 
coded candidate maaoblock, rf any, and (ii) said at least one corresponding averaged vertical motion vector 
component 
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12. The method of claim 1 1 , wherein: 

the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
the first and second field vertical motion vector conponents of the at least one field coded candidate macrcb- 
lock are averaged such that all fractional pixel offsets are mapped to a haH-pixel displacement 

1 3. A method for decoding a current block of digital video image data, said cunent block having associated differentiany 
encoded horizontal and vertical motion vector components, comprising at least one of the steps of: 

(a) determining a horizontal motion vector component used in differentially encoding said horizontal motion 
vector component of said current block according to a value based on horizontal motton vector components of 
first, second and ttiird candidate blocks; and 

(b) determining a vertical motion vector component used in differentially encoding a vertical motion vector com- 
ponent of said current block according to a value based on vertical motion vector components of said first, sec- 
ond and third candidate blocks; wherein: 

said first block is at least a portion of a first macroblock which immediately precedes said cun^ent block in 
a cunent row; said second block is at least a portion of a second macroblock which is immediately above 
said current block in a preceding row; 

said third block is at least a portion of a tiiird macroblock which immediately follows said second macrob- 
lock in said preceding row; and at least one of said first, second and third candidate blocks and said current 
block Is field coded. 

14. The method of claim 13, wherein: said cunrent block is a progressive-coded or advanced prediction-coded block; 
and 

at least one of said candidate blocks is a field coded candidate macroblock having first and second f iekJs, each 
with respective horizontal and vertical motion vector components. 

15. The method of claim 14, wherein: 

the horizontal motion vector component used in encoding tiie horizontal motion vector component of said first 
field of said current macroblock is determined according to a median of (i) the horizontal motion vector compo- 
nents of said candidate blocks other tiian said at least one field coded candidate maaoblock, if any, and (IQ the 
horizontal motion vector components of the first field of said least one f ieM coded candidate macrobtock; 
the horizontal motion vector component used in encoding the horizontal motion vector component of said sec- 
ond field of said current macroblock is determined according to a median of (i) the horizontal motion vector 
components of said candidate blocks otiier than said at least one field coded candidate macroblocK if any, and 
(ii) the horizontal motion vector components of the second field of said at least one field coded candidate mac- 
roblock; 

the vertical motion vector component used in encoding the vertical motion vector component of said first field 
of said current macroblock is determined according to a median of (i) ttie vertical motton vector components of 
said candidate blocks other tiian said at least one field coded candidate macroblock, if any. and (ii) the vertical 
motion vector components of tiie first field of said least one field coded candidate macroblock; and 
the vertical motion vector component used in encoding the vertical motion vector component of said second 
field of said current macroblock is determined according to a median of 0) tiie vertical motion vector compo- 
nents of said candidate blocks other tiian said at least one field coded candidate macroblock, if any, and (ii) the 
vertical motion vector components of tfie second field of said at least one field coded candidate macroblock. 

1 6. The method of claim 1 4 or 1 5, conprising the further steps of: 

averaging the respective first and second field horizontal motion vector components of the at least one field 
coded candidate macroblock to obtain at least one conesponding averaged horizontal motion vector compo- 
nent; wherein: 

tiie horizontal motion vector component used in differentially encoding tiie horizontal motion vector compo- 
nent(s) of at least one of said first and second fields of said current macroblock is determined according to a 
median of (i) the horizontal motion vector conponents of said candidate blocks otiier tiian saki at least one f iekJ 
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coded candidate macroblock. if any, and (ii) said at least one corresponding averaged horizontal motion vector 
component; and 

averaging the respective first and second field vertical motion vector components of the at least one field coded 
candidate maaoblock to obtain at least one corresponding averaged vertical motion vector component; 
5 wherein: 

the vertical motion vector component used in differentially encoding the vertical motion vector component(8) of 
at least one of said first and second fields of said current macroblock is determined according to a median of 
(i) the vertical motion vector components of said candidate blocks other than said at least one field coded can- 
didate macroblock, if any. and (ii) sa'd at least one corresponding averaged vertical motion vector component. 

10 

1 7. The method of claim 1 6. wher^: 



the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
IS the first and second field vertical motion vector components of the at least one field coded candidate macrob- 

lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 

18. The method of one of claims 13 to 17, v/herein said current block is a fieU coded macroblock having first and sec- 
ond fields, each with respective horizontal and vertical motion vector components, said method comprising the fur- 

20 ther steps of: 

determining a horizontal motion vector component used in differentially encoding said horizontal vntifkm vector 
components of said first and second fields of said cun^ent field coded macroblock according to a value based 
on the horizontal motion vector components of said first, second and third candidate blocks; and 
25 determining a vertical motion vector component used in differentially encoding said vertical motion vector com- 

ponents of said first and second fields of said current field coded macroblock according to a value based on 
the vertical motion vector components of said f irst, second and third candidate blocks. 

19. The metiiod of daim 18, wherein at least one of said candidate blocks is a fieki coded macroblock having first and 
30 second fields, each with respective horizontal and vertical motion vector components. 

20. The method of claim 19, wherein: 

tiie horizontal motion vector component used in differentially encoding said horizontal motion vector compo- 
36 nents of at least one of said first and second fields of said current field coded macroblock is determined accord- 

ing to a median of the horizontal motion vector components of said candidate blocks, including said 
conresponding horizontal motion vector components of said first and second fields of said at least one field 
coded candidate macroblock; and 

said vertical motion vector component used in differentially encoding sakl vertical motion vector components 
40 of at least one of said first and second fields of said current field coded maaoblock is determined according to 

a median of the vertical nation vector components of said candidate blocks, including said corresponding ver- 
tical motion vector components of said f rst and second fields of said at least one field coded candidate mac- 
roblock. 

45 21 . The metiiod of claim 1 9, comprising the further steps of: 

averaging the respective first and second field horizontal motion vector components of the at least one field 
coded candidate macroblock to obtain at least one correspondng averaged horizontal motions vector compo- 
nent; 

so wherein said horizontal motion vector component used in differentially encoding said horizontal motion 

vector components of said first and second fields of said current field coded macroblock is determined accord- 
ing to a median of tiie horizontal motion vector components of said candidate blocks otiier than said at least 
one field coded candidate macroblock, if any, and said at least one conresponding averaged horizontal motkm 
vector component; and 

£5 averaging the respective first and second field vertical motion vector oonnponents of the at least one field coded 

candidate macroblock to obtain at least one corresponding averaged vertical motion vector component; 
wherein: 

said vertical motion vector component used in differentially encoding said vertical motion vector components 
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of said first and second fields of said current field coded macroblock i$ determined according to a median of 
the vertical motion vector components of said candidate blocks other than said at least one field coded candl* 
date macroblocK if any, and said at least one corresponding averaged vertical motion vector component. 

6 22. The method of claim 21 , wherein: 



the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
the first and second field vertical motion vector components of the at least one field coded candidate macrob- 
10 lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 

23. The method of one of claims 13 to 22, wherein: 

the horizontal motion vector component used in differentially encoding saki horizontal motion vector compo- 
16. nent of said current block is determined according to a median of the horizontal motion vector components ol 

said first, second and third candidate blocks; and 

the vertical motion vector component used in differentially encoding said vertical motion vecta component of 
said cun-ent block is determined according to a median of the vertical motion vector components of saki first, 
second and third candklate blocks. 

20 

24. The method of one of claims 13 to 23, comprising the further step of: 

decoding sakJ candidate first second and tiiird blocks to recover saki horizontal and vertical motion vector 
components associated therewith. 

25 

25. The method of one of claims 13 to 24, comprising the further step of: 

recovering transmitted data witti said at least one differentially encoded motion vector corrponent indicating 
whetiier said cunrent block is field coded. 

30 

26. The method of daim 25, wherein: 



said coding mode designates the motion vector components of one of sad candklate first, second and fliird 
blocks as being tiie same as tiie motion vector components used in differentially encoding the motion vector 
35 components of said current block. 

27. The method of one of claims 13 to 26, wherein said digital vkleo image comprises macroblocks which are proc- 
essed row by row, comprising the f urttier steps of: 

40 processing said second and third macroblocks in said preceding row to recover said horizontal and vertical 

motion vector components associated tiierewitti; 

storing sakJ horizontal and vertical motion vector components associated witti saki second and tiiird macrot>- 
locks; 

processing said first macroblock in said cunrent row to recover said horizontal and vertical motion vector oom- 

45 ponents associated tiierewith; and subsequently retrieving said horizontal and vertical motion vector compo- 

nents associated witii said second and tiiird macroblocks for use in said at least one of said determining steps. 

28. A method for padding a digital video image which includes a fiekl coded vkJeo object plane (VOP) comprising top 
and bottom field pixel lines carried in an interleaved order to provide a reference padded VOP, sad VOP being car- 

50 ried, at least in part, in a region which includes pixels which are exterior to boundary pixels of sad VOP, saki 
method comprising tiie steps of : 

processing said top and bottom field pixel lines from said interleaved order to provide a top field block compris- 
ing said top field pixel lines, and a bottom field block comprising sad bottom field pixel lines; and 
55 padding said exterior pixels separately within sad respective top and bottom fled blocks. 

29. The method of claim 28, comprising tiie further step of: 
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processing said top and bottom field pixel lines comprising said padded exterior pixels in said interleaved order. 
30. The method of claim 28 or 29. wherein: 

5 when a particular one of said exterior pixels is located between two of said boundary pixels of said VOP in the 

con^esponding field block, said padding step comprises the further step of: 

assigning said particular one of said exterior pixels a value according to an average of said two boundary pix- 
els. 

10 31. The method of claim 28 or 29, wherein: 

when a particular one of said exterior pixels is located between one of said boundary pixels of said VOP and 
an edge of said region in the corresponding field block, but not between two of said boundary pixels of said 
VOP in the corresponding field block, said padding step comprises the further step of: 
IS assigning said particular one of said exterior pixels a value according to said one of said boundary pixele. 

32. The method of claim 28 or 29. wherein: 

when a particular one of said exterior pixels is located between two edges of said region in the corresponding 
20 field blocK but not between one of said boundary pixels of said VOP and an edge of said region in the con'e* 

spending field blocK and not between two of said boundary pixels of said VOP in the conresponding field block, 
said padding step conrprises the further step of: 

assigning said particular one of said exterior pixels a value according to at least one of: 

25 (a) a padded exterior pixel which is closest to said particular one of said exterior pixels moving horizontally 

in sakJ region in the corresponding field block; and 

(b) a padded exterior pixel which is closest to said particular one of said exterior pixels moving vertically in 
said region in the corresponding field block. 

30 33. An apparatus for providing horizontal and vertical motion vector components for use in differentially encoding 
respective horizontal and vertical nfx>tion vector components of a current block of a digital video image, wherein: 

candidate first, second and third blocks have associated horizontal and vertical motion vector components; 
said first block being at least a portion of a first maaoblock which immediately precedes said cun^ent block in 
35 acun-entrow; 

said second block being at least a portion of a second macroblock which is Immediately above sakJ current 
block in a preceding row; 

said third block being at least a portion of a third macroblock which immediately follows said second maaob* 
lock in said preceding row; and 
40 at least one of said first, second and third candidate blocks and said oun^ent bfock is field coded; 

said apparati^ comprising at least one of: 

(a) means for selecting a horizontal motion vector component for use in differentially encoding the horizon- 
tal motion vector component of said cun^ent block according to a value derived from the horizontal motion 

45 vector components of said first, second and third candidate blocks; and 

(b) means for selecting a vertical motion vector component for use in differentially encoding the vertical 
motion vector component of said current block according to a value derived from the vertical motion vector 
components of said first, second and third candidate bfocks. 



so 34. The apparatus of claim 33, wherein at least a particular one of said first, second and third macroblocks comprises 
a plurality of blocks, each of which have associated horizontal and vertical motion vector components suitable for 
use in differentially encoding said motion vector components of said cunrent block, said apparatus further compris- 
ing: 

55 means for selecting one of said plurality of blocks in said particular macroblock which is closest to an upper left- 

hand portion of said current block as the candidate block of said particular macroblock. 



35. The apparatus of daim 33 or 34, further comprising: 
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means for providing data for transmission with said at least one differentially encoded motion vector compo- 
nent indicating whether said cunrent block is field coded. 

36. The apparatus of one of claims 33 to 35, wherein: 

said cun-ent block is a progressive-coded or advanced prediction-coded block; and 

at least one of said candidate blocks is a field coded candidate macroblock having first and second f iekJs, each 

with respective horizontal and vertical motion vector components. 

37. The apparatus of claim 36, wherein: 

said means for selecting a horizontal motion vector component makes a selection according to a median of the 
horizontal motion vector components of said candidate blocks, including said conresponding horizontal motion 
vector components of sakJ first and second fields of saM at least one fiekl coded candidate macroblock; and 
said means for selecting a vertical motion vector component makes a selection according to a median of the 
vertical motion vector components of said candidate blocks, including said conresponding vertical motion vec- 
tor components of saki first and second fields of said at least one fietel coded candidate macrobkxdc 

38. The apparatus of daim 36, further conprising: 

means for averaging the respective first and second field horizontal motion vector components of the at least 
one fiekJ coded candidate macroblock to obtain at least one conresponding averaged horizontal motion vector 
component; 

wherein said means for selecting a horizontal motion vector component makes a selection according to 
a median of the horizontal motion vector components of said candidate blocks other than sad at least one f ieU 
coded candidate macroblock, if any, and said at least one conresponding averaged horizontal motion vector 
component; and 

means for averaging the respective first and second fiekl vertical motion vector components of the at least one 
field coded candidate macroblock to obtain at least one corresponding averaged vertical motion vector compo- 
nent; wherein: 

said means for selecting a vertical motion vector component makes a selection according to a median of the 
vertical motion vector components of said candidate blocks other than said at least one field coded candidate 
macroblock. if any, and said at least one corresponding averaged vertical motion vector component 

39. The apparatus of daim 38. wherein: 

the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
the first and second field vertical motion vector components of the at least one field coded candidate macrob- 
lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 

40. The apparatus of one of claims 33 to 35, wherein: 

said cunrent block is a field coded macroblod^ having first and second fields, each with respective horizontal 
and vertical motion vector components; said horizontal motion vector component selecting means selects a 
horizontal motion vector component for use in differentially encoding said horizontal motion vector components 
of said first and second fields of saki current field coded macroblock according to a value derived from the hor- 
izontal motion vector components of said first, second and third candidate blocks; and 
said vertical motion vector component selecting means selects a vertical motion vector component for use in 
differentially encoding said vertical motion vector components of said first and second fields of saki cun-ent 
field coded macroblock according to a value derived from the vertical motion vector components of said first, 
second and third candidate blocks. 

41. The apparatus of claim 40, wherein: 

at least one of said candidate blocks is a field coded candidate macroblock having first and second fieMs, each 
with respective horizontal and vertical motion vector components. 
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42. The apparatus of claim 41 , wherein: 

said means for selecting a horizontal motion vector component makes a selection for encoding said horizontal 
motion vector component of said first field of said cun-ent macroblock according to a median of 0) the horizontal 
motion vector components of said candidate btocte other than said at least one field coded candidate macrob- 
lock, if any. and (ii) the horizontal motion vector components of the first field of said least one field coded can- 
didate macroblock; 

said means for selecting a horizontal motion vector component makes a selection for encoding said horizontal 
motion vector component of said second field of said current macroblock according to a median of (i) the hor- 
izontal motion vector components of said candidate blocks other than said at least one field coded candidate 
macroblock, if any, and (ii) the horizontal motfon vector components of the second field of said at least one f ieU 
coded candidate macroblock: 

said means for selecting a vertical motion vector component makes a selection for encoding said vertical 
motion vector component of said first field of said current macroblock according to a median of (0 the vertical 
motion vector components of said candidate btocte other than sakJ at least one field coded candidate maaob- 
lock, if any, and (ii) the vertical motion vector components of the first fleM of sakJ least one field coded candi- 
date macroblock; and 

said means for selecting a vertical motion vector component makes a selection for encoding said vertical 
motion vector compon^t of said second field of sakJ current macroblock according to a median of (i) the ver- 
tical motion vector components of said candidate blocks other than said at least one field coded candidate 
macroblock, if any, and (IQ the vertical motion vector components of the second field of said at least one field 
coded candidate macroblock. 

43. The apparatus of daim 41 or 42, further comprising: 

means for averaging the respective first and second field horizontal motion vector components of the at least 
one field coded candidate macroblock to obtain at least one corresponding averaged horizontal motion vector 
component; wherein: the selected horizontal motion vector component for use in differentially encoding the 
horizontal motion vector component(s) of at least one of said first and second fields of said current macroblock 
is detemiihed according to a median of (i) the horizontal motion vector components of said candidate blocte 
other than said at least one field coded candidate macroblock. if any, and (ii) said at least one corresponding 
averaged horizontal motion vector component; and 

means for averaging the respective first and second f ieU vertical motion vector components of the at least one 
field coded candidate macroblock to obtain at least one conresponding averaged vertical motion vector compo- 
nent; wherein: 

the selected vertical motion vector component for use in differentially encoding the vertical motion vector com- 
ponent(s) of at least one of said first and second fields of said cun^ent macroblock is determined according to 
a median of (i) ttie vertical motion vector components of said candidate blocks other than said at least one field 
coded candidate macroblocK if any. and (ii) said at least one corresponding averaged vertical motfon vector 
component 

44. The apparatus of daim 43, wherein: 

the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
tiie first and second field vertical motion vector components of tiie at least one field coded candidate macrob- 
lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 

45. An apparatus for decoding a cun^ent block of digital video image data, said cun^ent block, having associated diffor- 
entially encoded horizontal and vertical motion vector components, comprising at least one of: 

(a) means for determining a horizontal motion vector component used in differentially encoding said horizontal 
motion vector component of said cunrent block according to a value based on horizontal motion vector compo- 
nents of first second and third cand'date blocks; and 

(b) means for determining a vertical motion vector component used in differentially encoding a vertical motion 
vector component of said current block according to a value based on vertical motion vector components of 
said first, second and tiiird candidate blocks; wherein: 
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said first block is at least a Portion of a first macroblock which immediately precedes said current block in 
a current roMr; 

said second block is at least a portion of a second macroblock which is immediately above said current 
block in a preceding row; 

s said third block is at least a portion of a third maaoblock which immediately follows said second macrob- 

lock in said preceding row; and at least one of said first, second and third candidate blocks and said cunrent 
block is field coded. 



46. The apparatus of daim 45. wherein: 

10 

said cunrent block is a progressiv&coded or advanced prediction-coded block; and 

at least one of said candidate blocks is a field coded candidate macroblock having first and second fiekJs. each 

with respective horizontal and vertical motion vector components. 



15 47. The apparatus of daim 46. wherein: 



the horizontal motion vector conponent used in encoding the horizontal motion vector component of said first 
f iekl of said cunrent macroblock is determined according to a median of (i) the horizontal motion vector compo- 
nents of said candidate blocks other than said at least one field coded candidate macroblocK if any. and (ii) the 

20 horizontal motion vector components of the first field of said least one field coded candidate macrobtock; 

the horizontal motion vector component used in encoding the horizontal motion vector component of saki sec- 
ond field of said current macroblock Is determined according to a median of 0) the horizontal motion vector 
conponents of said candidate blocks other than said at least one field coded candidate macroblocK if any, and 
Cii) the horizontal motion vector components of the second field of said at least one field coded candidate mac- 

25 roUock; 

the vertical motion vector component used in encoding the vertical motion vector component of said first f ieki 
of said cun-ent macroblock is determined according to a median of (i) the vertical motion vector components of 
said candidate blocks other than said at least one field coded candidate macroblock. if any. and (ii) the vertical 
motion vector components of the first field of said least one field coded candidate macroblock; and the vertical 
30 motion vector component used in encoding the vertical motion vector component of said second f leM of sakJ 

current macroblock is determined according to a median of (i) the vertical motion vector components of said 
candidate blocks other than said at least one field coded candidate macroblocK if any. and (ii) the vertical 
motion vector components of tiie second field of said at least one f ieU coded candidate macroblock. 

35 48. The apparatus of daim 46, further comprising: 



means for averaging the respective first and second field horizontal motion vector components of the at least 
one field coded candidate macroblock to obtain at least one con^esponding averaged horizontal motion vector 
component; wherein: the horizontal motion vector component used in differentially encoding the horizontal 
40 motion vector component(s) of at least one of said first and second fields of said current macroblock is deter- 

mined according to a median of (i) the horizontal motion vector components of said candidate blocks other than 
said at least one field coded candidate macroblocK ff any. and (ii) said at least one corresponding averaged 
horizontal motion vector component; and 

means for averaging the respective first and second field vertical motion vector components of the at least one 
45 field coded candidate macroblock to obtain at least one corresponding averaged vertical motion vector compo- 

nent; wherein: the vertical motion vector component used in differentially encoding the vertical motion vector 
component(s) of at least one of sakJ first and second fields of said curent macroblock is determined according 
to a median of (i) the vertical motion vector components of said candidate blocks other tiian said at least one 
field coded candidate macroblocK if any, and (ii) said at least one corresponding averaged vertical motion vec- 
50 tor component 



49. The apparatus of daim 48, wherein: 



the first and second field horizontal motion vector components of the at least one field coded candidate mac- 
55 roblock are averaged such ttiat all fracti'onal pixel offsets are mapped to a half-pixel displacement; and 

the first and second field vertical motion vector components of the at least one field coded candidate macrob- 
lock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement 
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50. The apparatus of claim 45, wherein said current block is a field coded macroblock having first and second fields, 
each with respective horizontal and vertical motion vector components, said apparatus further comprising: 

means for determining a horizontal motion vector component used in differentially encoding said horizontal 
motion vector components of said first and second fields of Said current field coded macroblock according to 
a value based on the horizontal motion vector components of said first, second and third candidate blocks; and 
means for determining a vertical motion vector component used in differentially encoding said vertical mofion 
vector components of said first and second fields of said cunrent field coded macroblock according to a value 
based on the vertical motion vector components of said first, second and third candidate blocks. 

51 . The apparatus of claim 50, wherein at least one of said candidate blocks is a field coded macroblock having first 
and second f iekJs. each with respective horizontal and vertical motion vector components. 

52. The apparatus of claim 51 , wherein: 

the horizontal motion vector component used in differentially encoding said horizontal motion vector compo- 
nents of at least one of said first and second fields of said current field coded maaoblock is determined accord- 
ing to a median of the horizontal motion vector components of said candidate blocks, including said 
conesponding horizontal motion vector components of said first and second fields of said at least one fieM 
coded candidate macroblock; and 

said vertical motion vector component used in differentially encoding said vertical motion vector components 
of at least one of said first and second fields of said current field coded macroblock is determined according to 
a median of the vertical motion vector components of said candidate blocks, including said conesponding ver- 
tical motion vector components of said first and second fields of said at least one field coded candidate mac- 
roblock. 

53. The apparatus of claim 51 or 52, further comprising: 

means for averaging tiie respective first and second field horizontal motion vector components of the at least 
one field coded candidate macroblock to obtain at least one con^esponding averaged horizontal motion vector 
component: 

wherein said horizontal motion vector component used in differentially encoding said horizontal motion 
vector components of said first and second fields of said current field coded maaoblock Is determined accord- 
ing to a median of tiie horizontal motion vector components of said candidate blocks otiier than said at least 
one field coded candidate macroblock, if any, and said at least one conresponding averaged horizontal motion 
vector component; and 

means for averaging the respective first and second field vertical motion vector components of tfie at least one 
field coded candidate macroblock to obtain at least one corresponding averaged vertical motion vector compo- 
nent; wherein: 

said vertical motion vector component used in differentially encoding said vertical motion vector components 
of said first and second fields of said current field coded macroblock is determined according to a median of 
ttie vertical motion vector components of said candidate blocks other than said at least one field coded candi- 
date macroblock, if any. and said at least one conresponding averaged vertical motion vector component. 

54. The apparatus of daim 53, wherein: 

ttie first and second field horizontal motion vector components of the at least one field coded candidate mac- 
roblock are averaged such that all fractional pixel offsets are mapped to a half-pixel displacement; and 
the first and second field vertical motion vector components of tiie at least one field coded candidate macrob- 
lock are averaged such that all fractional pixel offeets are mapped to a half-pixel displacement 

55. The apparatus of one of claims 45 to 52. wherein: 

the horizontal motion vector component used in differentially encoding said horizontal motion vector compo- 
nent of said cunrent block Is determined according to a median of the horizontal motion vector components of 
said first, second and tiiird candidate blocks; and 

the vertical motion vector component used in differentially encoding said vertical motion vector component of 
said cunrent block is, determined according to a median of the vertical motion vector components of said first, 
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second and third candidate blocks. 

56. The apparatus of one of claims 45 to 55, further 
comprising: 

a decoder for decoding said candidate first 
second and third blocks to recover said hori- 
zontal and vertical motion vector components 
associated therewith. 

57. The apparatus of one of claims 45 to 56, furthv 
comprishig: 

means for recovering transmitted data with said 
at least one differentially encoded motion vec- 
tor component indicating whether said cun^ent 
block is field coded. 

58. The apparatus of daim 57, wherein: 

said coding mode designates the motion vector 
components of one of said candidate farst sec- 
ond and third blocks as being the same as the 
motion vector conponents used in differentially 
encoding the motion vector components of said 
cun^ent block. 

59. The apparatus of one of claims 45 to 58, wherein 
said digital video image conrprises macrdblocks 
which are processed row by row, further compris- 
ing: 

means for processing said second and third 
macroblocks in said preceding row to recover 
said horizontal and vertical motion vector com- 
ponents associated therewith: 
means for storing said horizontal and vertical 
motion vector components associated with 
said second and third macroblockB; 
means for processing said first macroblock in 
said cun'ent row to recover said horizontal and 
vertical motion vector components associated 
therewith; and 

means for subsequently retrieving said hori* 
zontal and vertical motion vector components 
associated with said second and third macrob- 
locks for use by said at least one of saM means 
for determining. 

60. An apparatus for padding a digital video Image 
which includes a field coded video object plane 
(VOP) comprising top and bottom field pixel lines 
canried in an interleaved order to provide a refer- 
ence padded VOP, said VOP being carried, at least 
in part, in a region which includes pixels which are 
exterior to boundary pixels of said VOP, said appa- 
ratus comprising: 



means for processing said top and.bottom f iekJ 
pixel lines from said interleaved order to pro- 
vide a top field block comprising said top fieU 
pixel lines, and a bottom field block comprising 
said bottom field pixel lines; and 
means for padding said exterior pixels sepa- 
rately within said respective top and bottom 
fieU blocks. 
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